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ABSTRACT 

The rationale for and development of the Classroom 
Behavior Scales (CRB) are discussed in connection with applications 
on four separate samples of Head Start, kindergarten and first grade 
children. Studies of observer agreement, stability, structure, 
comparisons over sex, classes, conditions, observers and over time 
are presented. Results showed high observer agreement, 
non-significant repeated trail correlations, consistent and expected 
changes in scale means over time — Head Start to first grade, unstable 
factor structures over samples and over repeated observation of the 
same sample. Differences between classes were consistently 
significant. An application with an experimental program did not show 
significant differences, which was an expected result. Classroom 
observational strategies are compared to educational and 
psychological testing as ways of getting measures of children in 
schools. While the latter will, in general, put a premium on 
stability, as a condition of test reliability, the former will 
emphasize individual variability. Data from CRB applications is 
presented to support this contention. The key for the scales of the 
CRE is appended. {Author) 
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. ABSTRACT 

The rationale for and development of the Classroom Behavior 
Scales ( CRB) are discussed in connection with applications on four 
separate samples of Head Start, kindergarten and first grade children. 
Studies of observer agreement, stability, s t ructu re , compar i sons over 
sex, classes, conditions, observers and over time are presented. 

Results showed high observer agreement, non-significant repeated trial 
correlations, consistent and expected changes in scale means over time -- 
Head Start to first grade, unstable factor structures over samples and 
over repeated observation of the same sample. Differences between 
classes were consistently significant. An application with an experi- 
mental program did not show significant differences, which was an expected 
resul t . 

Classroom observational strategies are compared to educational 
and psychologi cal testing as ways of getting measures of children in 
school s, . While the latter will, in general , put a premi urn on s tab i 1 i ty , 
as a condition of test reliability, the former wi 1 1 emphasize individual 
variability. Data from CRB applications is presented to support this 
contention. Furthermore, tests are developed with '‘success" in terms 
which mean successful individuals. This is clearly a form of institu- 
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tional bias which perpetuates existing conditions of potential and 
opportunity. Attention to classroom process wi } ] set up a more direct 
connection between intervention and effect than exists in traditional 
designs which tend to ignore or control for instructional variables. 

If goals derived from process oriented studies favor certain groups 
it will be more apparent than when those goals are obscured by the 
mysteries of test development. 

Efforts in the direction of classroom behavior measurement 
call for different research methodologies, designs, measurement techniques, 
budgets and technology. Since they must necessarily be closely tied to 
teaching and learning in schools, they will have a more immediate effect 
on educational practices* and they will more immediately be effected by 
them. Therefore, process research will be involved in settings and will 
necessarily be directly associated with change. 
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1 . I n troduc t ion 



Although there have been many efforts to both describe and study the 
correlates of attitudes (Bowers, Davis, and Bowers, 1 962 ; Travers and others, 
1961), styles (Hughes, 1 959 ) 3 methods (Hayes, 1966), characteristics (Ryans, 

1 96 O ; He i 1 and Washburne, 1962; Shim, 1965), and specific behaviors (Kounin 
and Gump, 1961) of teachers, there have been few efforts to focus on individual 
child behavior in classroom situations (Perkins, 1964, 1965)* The latter calls 
for an extensive observational program over an extended period of time, as 
well as clearly stipulated dimensions which can apply to diverse situations, 
activities, and grade levels. Those investigators who have attended to 
classroom dynamics have general Iv limited themselves to highly specific 
behaviors such as language (Bellack and others, 1966) or kinds and amounts of 
social interactions (Amidon, 1966; Flanders, 1965). Because these variables 
are so specific to particular kinds of classes, ages of children, and subject 
matter, they are difficult to generalize across grade levels, subject matters, 
styles of classroom organization or school systems. They are even more diffi- 
cult to apply as part of a training model. 

In order to explore an attentive approach to observing, recording, and 
describing “educational” behavior of children, we have focused on productivity 
of children in classroom situations, which includes any and all parts of the 
school day, such as field trips and times when children are eating, resting, 
or in transition. 

It is assumed that the impact of an educational experience on a child 
will be a function of the child’s productivity in a succession of class days. 
Assessment of productivity involves judgments about what is taking place as 
well as the intensity of experiences for each child. 

The measurement of intelligence involves linearly combining scores on 
a variety of items, each of which implies judgments about selection and 
criteria for success, but none of which is essential to a “high 11 score, 

The concept of intelligence, as operationalized in intelligence tests, is 
useful partly because it does not depend on highly specific abilities or 
reactions. Similarly, successful productivity can take place in a variety 
of ways and in many kinds of situations. 

O 
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While productivity is here conceived of as a dependent variable 



dependent upon teacher, other children, classroom atmosphere, interactional 
patterns, and curriculum -- it is possible to raise questions about patterns 



teachers and between different school environments. Our preliminary work 
and our review of research in this field indicate that there is no simple 
connection between pupil performance and either teacher attitudes, or the 
behaviors of teachers in classroom situations, which have come under the 
headings of style, methodology, or instructional modes (Cogan, 1959; Harvey, 
1967)* The variability of child behaviors in different classroom situations 
and with different teachers is more or less accepted, but the implications 
of this have rarely affected research or practice (Cronbach and Snow, 19&9) • 
There is much talk about individual differences of both teachers and children, 
and there is glib acceptance of the axiom that particular classrooms might be 
better for some children than for others, but placement of children in 
classrooms is generally immutable. This research has specifically addressed 
itself to questions about variance and invariance of child behavior over 
time and in diverse educational settings. 



of productivity in individual children across time, within classes, between 
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I I . Rationale 



In any given time segment, a child's classroom behavior can be charac- 
terized as being more or less involved and at a higher or lower level of ; 

functioning. It is assumed that a classroom that generates higher levels of ; 
productivity for longer periods of time will have greater educational impact j 
on its members. Although there is considerable evidence to the contrary, it ■ 

i 

remains to be seen whether methods, materials, or teaching style bear any J 
direct relationship to productivity of individual children in any given } 

sample of classes (Cogan, 1959; Harvey, 1967). [ 

Quantification of productivity involves measurement of the extent of ; 
involvement in particular activities in which children are engaged. Ascer~ \ 

taining levels of productivity involves evaluation of the process in which j 

children are engaged, and is not dependent upon curriculum -- specific | 

materials being used, subject matter being studied or method being applied -- j 

age of children, size of class, presence of teacher or other children, or 
whether or not it is a prescribed or proscribed activity. * 

Any child behavior can be looked at both in terms of form and content. 

We can ask questions about the structure of an interaction, a play activity., j 
a lesson, or a lunch period. We can also inquire about the content of any of 
these activities. In order to have sufficient generality, a scale which is ; 

to be applied to such diverse phenomena must be sufficiently abstract so that: 

it is not dependent upon specific characteristics of any one kind of form or f 
content . 

As a basic model for these scales, we have drawn heavily upon Guilford's 
“dimensions of intellect* 1 (Guilford, 1957), particularly those that he calls 
“products'* and “operations." Form that is relatively mechanical and which 
depends upon authoritative transactions is considered to be low level. Forms; 
which are reciprocal and which depend upon experimentation, dialogue, and 
accommodation are considered to be of high level. Content that consists 
principally of labels, definitions, and memorization, whether it be colors 
and shapes or intellectual history, is of low level. On the other hand, 
content that involves transformations over time, space or people, implications 
or analyses is considered high level-.- 



In both form and content there is a developmental progression from 
simple, unitary and one way processes to more differentiated, reciprocal 
and abstract processes. The application of the concept of productivity to 
educational process will require constant operational redefinition of dimen- 
sions of form and content in order to establish the unitary nature of the 
concept with regard to diverse educational situations* 

In addition to ascertaining amount and quality of productivity that 
takes place in classroom situations, it is necessary to determine conditions 
under which it takes place, both in order to determine whether or not produc- 
tivity is independent of other factors, and to give it a more valid meaning 
by placing it in its proper context. Conditions include : control -- 

who initiates activities and who is responsible for them being carried out; 
size of group -- including other children and teachers, direction and kinds 
of social interactions that are transpiring, and curriculum that is manifest 
at the time. 

Thus, in any given time segment, individual children in classes are 
Involved in processes which can be observed, and resulting recordings can 
be made which indicate levels of productivity. Furthermore, we can judge 
the extent of child involvement, continuity of the extent and kind of involve- 
ment over time, and the relative variation of children’s productivity in any 
particular class, over time. We can observe and describe the conditions 
under which this productivity takes place in order to provide measurements of 
variables which will describe both individual child and class behavior in 
such a way as to bear a direct relationship to the assessment of educational 
impact. This descriptive process is not dependent upon teaching styles, 
methods, or materials, but can be systematically related to them in order to 
throw light on questions about appropriate placements for children in given 
educational situations. 

Productivity is here conceived of as a dependent variable dependent 
upon the quality of contact that a teacher makes with a child individually or 
with a group of children, and the interaction of that contact with what 
individual children and teachers bring to classroom situations. It can also be 
conceived of as an independent variable which contributes to classroom 



behaviors of children in ensuing classroom and ou t-of-schoo 1 situations, 
and to cognitive, social, and emotional growth which can be measured either 
by the use of tests or through additional observational procedures. 

Our goal is to be able to discuss atmospheres which are associated 
with greater and lesser productivity, but which are not amenable to any 
straight forward categorization, such as structured versus non-s t rue tured 
or democratic versus authoritarian. It may be that the only valid way to 
present data on atmosphere is through films. 

There are several important assumptions behind this rationale which 
have affected the choice of an observational strategy and the content of that 
strategy. Foremost is the position that children can change their patterns 
of behavior in such a way as to drastically affect their ability to succeed 
in school and also to change standards of success. These changes cannot 
come about by the thoughtless application of an intellectually attractive 
methodology, the institution of a nation-wide program for poor children or 
the imposition of prescriptions which consider neither the teachers that use 
them or the children who are supposed to benefit from them. Changes in 
children will not be accomplished without careful attention to the educational 
process in which they are involved-. The “little black box“ approach, which 
includes a program which is presumed to operate in the same way on all 
children, accompanied by a series of pre and post tests, will inevitably lead 
us to the conclusion that children cannot change, although that is hardly the 
intended hypothesis that is being tested in most “black box" interventional 
studies. The real hypothesis that has been tested over and over again is 
whether any given interventional technique has a uniform effect on all 
children, which is, incidentally, confounded with the hypothesis that all 
teachers involved in an interventional program will administer it in the same 
way. The results of research on these two hypotheses are unequivocal (Barr, 
1961; He i 1 and Washburne, 1962; Ryans, I960; Shulman, 1970; Stephens, 1967; 
Turner, 1965)* In addition to indicating that variations from interventional 
procedures are trivial compared to variation between teachers and individual 
children, they suggest that educational evaluation must concern itself with 
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variation within classrooms and within children rather than between programs, 
methodologies, and prescribed styles. 

Therefore, in order to study and effect change, it will be necessary to 
address ourselves to those behaviors and situations that contain meaningful 
components of variability and which will provide a more direct and valid test 
of the hypothesis of change and of the potential for change. 

A second critical assumption has to do with the relationship between 
data collected and agents who are directly involved with these data. We 
would question the collection of classroom data which are not or cannot- be 
shared with teachers and which are designed and developed without their 
intimate and sustained involvement — * not because of a moral question but 
because of a scientific one. Teachers are links in the chain of change and 
are crucial to both its study and its accomplishment. To ignore them by 
creating instruments which are irrelevant to them or by designing research 
which simply uses them as objects of study will almost c-uarantee that they 
will not change and, consequently, that children in their classes will not 
change. Therefore, it is assumed that the rationale on procedures must be 
within the realm of understanding and application of teachers, not necessarily 
that they agree with it. 
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111. Procedures 



Des i gn 



The variations and correlates of productivity have been examined across 
the following dimensions of independent variation: 
grade level 

experimental class versus regular class placement 

observers 

classes 

time 

The specific crossing of these dimensions has depended upon the character- 
istics of educational organization that was involved in each project. 

The selecting of variables to be used as correlates of productivity 
has depended upon testing programs in the schools and agencies in which we 
worked, which included the national Head Start evaluation testing program. 

Reliability of the procedure (observer agreement) has been systematically 
studied by having two observers rate the same child during the same time 
interval. The study' of observer agreement has been built into each separate 
study because of changing observer personnel and the diversity of educational 
situations that have been used. 

Instrument development has been an important part of the overall design. 
Variations rn time segments, scheduling, observers, sampling of activities and 
variable definition have been used in order to further examine possibilities 
of the procedure. 

Specific data on productivity has been obtained over the aforementioned 
conditions in order to provide a measure of the potential impact of an 
educational program on individual children and groups of children. 

The procedures for assessing productivity have been applied to four 
samples of preschool (Head Start) and first grade children. Each application 
was aimed both at instrument development and at obtaining data which was to 
be used to more adequately describe variations in impact across children, 
teachers, and activities. In Brockton, Massachusetts, productivity of 56 
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Head Start children was assessed in seven classes where children had been 
randomly placed into classes, and teachers randomly assigned* Productivity 
recordings were again made on these same children during the spring and, 
again, during the fall of their first grade of public school in order to ex- 
amine behavioral consistency of children in different classroom situations 
and over a period of a year* 

A second application, using productivity as a principal dimension, 
involved an evaluation of eleven Follow-Through control and experimental 
classes in the Cambridge, Massachusetts, public schools (66 Children*) 

This involved kindergarten and first grades in experimental and control 
classes with four observers and six observational periods* 

Another use of the CRB was with Head Start classes in and around 
Boston, in order to examine variability of productivity and observer agree- 
ment . 

A fourth application explained the correlates of productivity for 76 
children in a testing program which included psychometric evaluation, parent 
interviews, sociometric evaluation and achievement tests* 

This process of assessing productivity has been used on a limited 
basis in six classes for emotionally disturbed children, each of which was 
visited extensively by an experienced observer, in order to assess the 
validity of the instrument to get at classroom variations which had been 
assessed independently (Merrill, 1 96 )• 

Resu 1 t 

These studies have shown that productivity, as will be operationally 
defined below, is a function of classroom environment; that there are consis- 
tent between-cl ass differences in productivity; that observer agreement is 
such as to warrant the use of group data in evaluation studies (observer 
agreement correlations range between .60 and . 92 ); and that productivity appears 
to be a unitary dimension which cuts across activities, ages, and teacher 
methodologies (or styles)* Future applications of the process wi 1 1 be more 
specifically concerned with intellectually and emotionally disabled children 
who are in regular and special class settings, with particular emphasis on 



preschool and primary grade level children, and evaluating productivity of 
children in classes which have been independently judged as being “high 
potential impact .“ 

I ns trumen ta t ion 

Classroom Behavior Scales (CRB): 

Scales have been developed to be used in classrooms as well as in 
classroom activities that take place outside of the physical environs of the 
classroom, in order to assess amount, kind, and conditions of productivity 
displayed by children of ages three to twelve (preschool and elementary.) * 
This strategy focuses on child behavior in any and all situations. Ten- 
minute intervals are selected on a time sampling basis and records are made 
every thirty seconds. Fourteen scales are used to describe productivity of 
individual children. Scales and categories into which they fit are as 
fol lows : 

J. amount of productivity 

a. participation: rejection to involvement 

2. kind of productivity 

a. process focus — form: authoritarian to experimental 

b. process focus content: mechanical to transformation 

c. use of materials: irrelevant to unique 

d. curricular activities 

3. conditions of productivity 

a. control -- overt: teacher to child directed 

b. control -- covert: teacher to child directed 

c. behavior: withdrawn to hyperactive 

d. interaction: agents types and modes 

e. group size of specific activity being observed. 

Observer agreement correlation (two observers recording on the same child at 
the same time) have varied between .60 and .92. Low correlations were found 
where there was restricted range in the use of the scales. 

The CRB is a rating process rather than a rating scale. It involves 
an extensive training period for observers and it assumes a continuing flow 
of communication between observers and teachers. Ratings that are made are 
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routinely duplicated, given to teachers, and discussed so that a feedback 
process is always going on, depending upon the interest and willingness of 
the teachers involved. 

Observers are required to have observed a class long enough so that 
they know the names of all children before any recording begins. 

A series of ten-minute training films have been developed, each of 
which focused on an individual child in a setting similar to the one used. 

In addition, simulta-eous video taping and CRB recording have been done on 
a selected number of cases in order to provide a continuing check of observer 
re 1 i ab i 1 i ty . 

The key for the CRB, as well as the form used by observers, is included 
as an appendix. 

Video Taping 

Once we have determined who are the most productive children and classes, 
it becomes important to say something about the teaching which produces this 
productivity. We are dissatisfied with categories that have been used and are 
being used because they do not appear to hold up. Furthermore, we doubt that 
we can create useful categories at the present time because we have not 
arrived at a useful theoretical formulation which connects the various 
components of the teaching process. There are two poss i b i 1 i t ies for arriving 
at a more suitable independent variable which would be theoretically related 
to productivity and which would be useful to teachers, supervisors, trainers, 
and researchers, in gaining a more useful and greater understanding of the 
teaching process: (l) differentiation of teacher personalities (Jackson, 

1963; Bjerstedt, 1967; Zimiles, Rabinowitz, and Hoy, 1964), (2) relation of 
variation in classroom atmosphere to patterns and amounts of child productiv- 
ity (Sears, 1 963 ) • Both of these alternatives are beset by rather over- 
whelming measurement problems, as is well illustrated in the literature. 

Given the difficulties of measurement, the strategy for developing a 
data base for relevant input variation must necessarily be fairly crude 
and must depend more upon the recipient than the giver. One approach to this 
is to develop carefully selected video tapes of classrooms that differ 



markedly in the kinds and amounts of productivity that children display. These 
video tapes then would become basic data to be presented, along with data on 



In addition, they can be used in order to train observers, provide a more 
complete feedback system for teachers and stimulate the formulation of hypo- 
theses regarding the aforementioned two alternatives- 

Sampl e 

The numbers of children, classes, and disability areas used in these 
studies has depended upon negotiations with several public and private schoel 
systems. The goal has been to sample special and regular classes of pre- 
school and elementary age children, including children who have been diagnosed 
as mentally retarded, emotionally disturbed, and learning disabled. Head 
Start classes have been sampled, including those in which disabled children 
have been placed and those in which they have not been placed.. 



productivity of individual children and of groups of children (Cogan, 1963)* 
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IV. RESULTS, CONCLUSIONS AND SPECULATIONS 



Observer Agreement 

In connection with two applications, observer agreement was explicitly 
studied. Two observers completed ten“minute recordings for the same child, 
at the time. The team of observers was systematically rotated so that all 
observers were paired with all others. The first agreement study (1968) 
used six observers, making fifty matches (fifty children, u hundred protocols). 
Each CRB scale was analyzed separately. Correlations ranged from .23 to .92 
(between first and second observers) with a median of .60. Scales with low 
correlations had restricted ranges of scoring responses -- on a five-point 
scale, only two or three points were used. The wording of these scales was 
revised in order to encourage use of the total range. In 1969*. fifty matches 
were used with the revised scales. Correlations ranged between .60 and .92 
wi th a med i an of .75 . 

Such levels of agreement were reached only after extensive training 
of observers, using video tapes and films, and careful study and delimitation • 
of the scales. Although we were interested in idiosyncratic observer responses, 
we were also trying to develop a common language that would be useful for 
discussing the behavior of children in and out of classrooms. A rigid and 
excessive value put on observer agreement can eliminate perceptions which 
are critical to becoming aware of differences between children, classrooms 
and teachers. Disagreement between observers may mean that a given scale 
means different things to each, but it may also mean that they are seeing 
the situation in different ways and that both responses are precise, Further, 
it may be more important that the research reflects both views, than that it 
either eliminates one or compromises both in the name of observer agreement 
reliability. In subsequent research we will deal more directly with the 
relationship between styles of observation and behavioral styles of children 
and teachers. It may be that observer variations (rather than disagreement) 
will be a fertile source for the study of potentials for change. We would 
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expect that there is an important interaction between the observer and the 
behavior being observed which is, generally, either experimentally or sta- 
tistically, controlled or cancelled out. It is often assumed, in the name 
of reliability, that observer agreement is synonymous with precision and 
that observer disagreement is either the result of ambiguous wording or 
poorly defined concepts. It is our contention that there is an important 
connection between observer disagreement and the potentials for change of 
teachers and children. 

It is necessary here to distinguish between two types of disagreement: 

(1) Disagreement is undesirable when discrepancies are attributed to 
misunderstanding of concepts and operations that are used in par- 
ticular scales; 

(2) Disagreement is desirable when there is basic understanding of 
the concept and operation of particular scales, but behavior in 
the classroom is seen in different ways because of important 
personality and value differences between observers. 

An understanding of this second group of differences is critical to further 

instrument development where the principal focus is on how observational 

data can be used as a basis for feedback to teachers. 

Although we did not get very far with this strategy in CRB studies, 
we did have one opportunity to apply the second kind of observer disagree- 
ment to a feedback model. We had two observers in the Brockton classrooms, 
both with small portable, battery-operated tape recorders. While they were 
in the classrooms, they continuously spoke in the recorders about what 
children were doing, what teachers were doing and what the general atmos- 
phere of the classroom was like. Both of these observers had considerable 
experience teaching preschool children as well as children with intellectual 
and emotional disabilities, and both had specific training in observation. 

The tapes of their observations were transcribed. Each of these observers 
had his own purpose for recording observations, but a general project purpose 
was to provide teachers with feedback. Therefore, we encouraged teachers to 
read the transcripts of both observers who were in their classrooms. In this 
way, they were able to become directly and immediately aware of how different 
observers viewed what they were doing, the different language that was used 
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to describe similar behavior of the same children and teachers, and two 
different attitudes about the presence or absence of structure in their 
classrooms. This was an extraordinary demonstration of observer disagree- 
ment of the second kind, which teachers were able to accept. It resulted 
in extensive dialogue over a period of time with both observers, with parents 
and with their supervisors. We think that this kind of procedure should be 
formalized so that a narrow view of observer agreement synonymous with 
reliability will not sterilize observational data. 

Stabi 1 i ty 

After a fair degree of observer agreement was established, wo began 
asking questions about the stability of each scale and of the total CRB 
structure. How consistent are individual children and class groups over 
time? Is stability affected by time interval, classroom, observer or age? 

Does the mu 1 t i -var i a te structure of the CRB remain constant over time, and 
for different samples? 

Although it was not possible to set up designs to get unequivocal or 
even comfortable answers to these questions, we were able to deal with them 
in different ways. With the Cambridge Follow-Through sample (N = 66), 
three sets of observations were made on two separate days -- a total of six 
ten-minute recordings for each child. Means from each recording were cor- 
related with means from other recordings for each scale. This resulted in 
6X6 correlation matrices for each scale. The correlations for each scale 
within each day were uniformly positive and between .25 and .50. Correlations 
between days were uniformly non-significant (less than .25) with about equal 
numbers of positive and negative correlations. The part i c i pa t i on , behav i or 
and control scales were quite consistent — they were also the scales that 
had the highest correlations of observer agreement. The process focus 
scales were somewhat more erratic. The i nteract i ona I scales were extremely 
erratic and were extensively revised for later applications, as a result of 
these data. 
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The Brockton sample was followed for a full year, with CRB scales 
being used to collect data in the summer (during Head Start), fall (first 
grade) and spring (first grade) on the same group of children (N = 5°) * 

Six observations (two days) were made during the summer, nine (three days) 
in the fall and three (one day) in the spring. Observers had to achieve 
high standards of proficiency before they began formal observation. 

Correlations for each scale, for summer x fall, fall x spring and 
summer x spring, were near zero (non-significant). CRB measurements did 
not reveal consistency of classroom behaviors of individual children between 
these three periods. They did reveal a fairly high degree of internal con- 
sistency for each of the scales. 

The repeated testings of the Brockton sample over a one year period 
provided data on group stability. Mean scores of the sample in Head Start 
are consistently and significantly different from mean scores of the sample 
in first gr ad e . 

Table One 

Means and Standard Deviations for CRB 
Brockton Sample: Summer, Fall and Spring 



Sea 1 e 


Parameter 


Summer 
N = 56 


Fal 1 
N = 52 


Spr i ng 
N « 50 


Con tro 1 


Mean 


3-51 


2.60 


2.35 




St. Deviation 


• / 


-27 


• 56 


Process Focus: 


Mean 


1.79 


2.27 


2.34 


Form 


St. Deviation 


• 35 


.34 


.40 


Process Focus: 


Mean 


1 -73 


3-49 


3-57 


Content 


St. Deviation 


• 34 


• 30 


.27 


Part i c i pa t i on 


Mean 


3.68 


3-10 


3.06 




S t . Dev i a t i on 


■ 33 


.17 


.23 


Behav i or 


Mean 


CO 

oo 


2.86 


2.82 




St. Deviation 


• 30 


.21 


.26 
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It can be seen In Table One that control goes from the child, during 
Head Start, to the teacher during both CRB observations in the first grade; 
process focus , form and content both go up in the fall of the first grade 
and stay up in the spring of the first grade; participation gets less and 
behavior becomes more subdued during both first grade observations. The 
differences and directions are understandable in light of extensive anec- 
dotally reported descriptions of Head Start and first grade classrooms in 
Brockton. 

Looking at group differences in the Cambridge Follow-Through data, 
analyses of variance were done for each scale on each of the six trials, 
on the means and ranges of trials 1-3 and 4-6, and on the means and ranges 
of trials 1-6. The results of these analyses and more extensive descrip- 
tion of results and interpretation can be found in the section on CRB var- 
iation below and in Table Five of that section. Classes vjere found to 
differ significantly and consistently on con t ro 1 , process focus-form , par - 
ti c i pa t ion , style , group size and on most of the interaction variables. 

CRB data described classes uniquely, even if stability coefficients over 
individual children were near zero. 

Structure 

The definition of structure for an instrument that does not have 
demonstrable stability over individuals is a precarious undertaking, even 
if group data suggests stable group functions. Correlation matrices from 
each application show extraordinary and erratic variability, wh i ch is con- 
firmed by factor analyses... Factor structure is generally different for each 
sample, the only similarity being the common presence of what can be roughly 
termed as maturity and interaction dimensions. However, these can be inter- 
preted as being artifacts of the two types of items which were in the CRB. 
Even in the respective Brockton matrices, calculated from data on the same 
children, structures are different for summer, fall and spring observations. 
A cursory inspection of correlation matrices shows some with many high r's, 
and others with relatively few. Average correlation coefficients for these 




- 16 - 



23 



matrices range from .13 to .75 with a median of .23. The group of variables 
with the high estimated communalities vary considerably from one analysis to 
the other, both in those analyses using original samples (50 to JG subjects) 
as well as in analyses of combined samples (n's = 152, 202 and 139). 

Factor analyses were done on data from seven separate samples and 
three combinations of these samples. Three of the samples and one combination 
were recorded using the original CRB. The remaining samples and combinations 
were recorded using a revised form, as was discussed in the opening part of 
the previous section. The revised form included several reworded items -r 
in order to extend the range of responses -- and a different way of handling 
social interaction. Information on samples and combinations is presented in 
Table T wo . 



Table Two 

CRB Samples for Factor Analyses 





Year 


N 


a of 

Variables 


CRB 

Form 


Average 

R 


Sum H 


Samp I es 














Cambridge Fol low-Through 
(C FT) 


68 


66 


17 


Original 


.18 


8 . 60 


Boston Head Start ( BHS) 


68 


50 


18 


1 1 


.12 


7-53 


Brockton Summer (BrS) 


68 


56 


17 


1 1 


.24 


10.68 


Brockton Fa 1 1 ( BrF) 


68 


52 


18 


Rev I sed 


• 35 


1 1 .93 


Brockton Spring (BrSp) 


69 


50 


18 


1 1 


.23 


1 1 .66 


Boston South End (SE) 


69 


50 


18 


M 


.54 


14.89 


Cambridge (Camb) 


69 


26 


18 


1 1 


.75 


16.28 


Combined Samples 














BHS S CFT & BrS 


68 


152 


18 


Original 


.13 


7-68 


BrF & BrSp 


69 


202 


19 


Rev i sed 


.19 


11.20 


SE & Camb 


69 


139 


19 


u 


.20 


11.04 
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Estimated communa 1 i t i es were inserted into the main diagonals of 
correlation matrices, all latent roots and factors were extracted, and those 
factors with roots greater than unity were rotated, using a normal varimax 
criterion. Rotations were done stepwise -- first, the two factors with the 
largest roots, then the three factors with the largest roots, etc. 

For the three combined samples, the latent roots greater than unity 
and the percentages of trace accounted for are presented in Table Three; 
normalized communal i t i es and three factor rotations for each scale are 
presented in Table Four. 



Table Three 

Latent Roots of CRB Factor Analyses 



BHS £ 


CFT & BrS 


BrF £ 


; BrSp 


SE & 


Camb 


N 


- 152 


N = 


- 202 


N = 


139 


Latent 


% of 


Latent 


% of 


Latent 


% of 


Root 


T race 


Root 


T race 


. "Root 


T race 


3.03 


40 


3.84 


3*4 


-Cr 

CO 


38 


1.58 


21 


2.67 


2k 


2.53 


23 


1 .21 


J_6 


2.06 


18 


2.15 


20 




77 


1 .29 


12 




81 



1,00 _9 

97 
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Table Four 



Normalized Communa 1 1 t 1 es and Three Factor Rotations 
From Three Combined Samples 



Samp 1 es 



Sea 1 e 


BHS 


6 CFT 


& 


BrS 


BrF 


& BrSp 




Se 6 


Camb 






i 1 


1 


i : i 




I 


II III 




| 


II ) 1 I 


Quan t i ty 






















Par t i c i pa t i on 


.73 


X 






.67 




X 


81 


X 




Qua 1 i ty 






















Process Focus 


.71 


X 






•73 






1X10 


X 




Form 






















Process Focus 


-71 


X 






.73 






1X10 


X 




Content 






















Use of Mat 1 1 s 


• 35 


X 






.49 






58 


X 




Condition 






















Control (Overt) 


.49 


X 






• 94 


X 


X 


■ 62 


X 


X 


Control (Covert) 


na 








.94 


X 


X 


62 


X 


X 


Behavior 


• 73 


X 






.56 




X 


78 


X 




Role of Child 


na 








.80 


X 




81 


X 




Group 






















S i ze 


na 








.46 




X 


•63 




X 


Circle 


na 








.44 






• 51 






No . of 1 1 s 


na 








.46 






• 72 




X X 


Interaction Variables 






















Style or Mode 


.20 








.91 




X 


67 


X 




Style 0 1 s 


• 37 




X 




na 






na 






No Interaction 


• 57 




X 


X 


na 






na 






Chinch Attention 


.48 








na 






na 






CH-ch Verbal 


1 .00 






X 














ch“CH Attention 


• 39 




















ch“CH Verbal 


1 .00 






X 














CH“T Attention 


.46 




X 
















CH“T Verbal 


• 35 




















T“CH Attention 


.23 




X 
















T-CH Verbal 


• 57 








na 






na 






Agents 


na 






1 


.00 


X 


X 


1 .00 




X 


No. of 0‘s 








1 


.00 


X 




• 83 




X 


No . of 1 1 s 








. 1 


.00 


X 




• 83 




X 


No. of 2‘s 










• 79 




X 


1 .00 




X 


Type 










• 91 


X 


X 


• 52 






Appropr i ateness 


na 








.60 






•57 


X 




Notes: x Indicates factor 


load i ngs 


greater 


than 


,4. 











See Appendix for scale descriptions. 

CH — child being observed; ch -- other child; T -- teacher; 
na — not appl i cable . 
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In each of the three analyses, there is a factor that loads on process 
focus , parti c i pa t i on , behav i or and control ; and two factors that load on 
interactional variables plus control . This speaks both to what has to be 
done about the instrument as well as to the internal structure of the CRB. 
There is not enough structural similarity accross samples to justify the 
use of factor loadings in scoring the instrument, or to make a direct con- 
nection between the CRB and inferences about the structure of classroom 
behav i or . 

Variations between samples confound this instrumental error with 
legitimate sampling variations. Included in the former is the observer 
training process and the selection of observers. We have not been able to 
carefully check out this confounding, but it has become clear that there is 
a discrepancy between scale and productivity variability. There was a 
dearth of classroom behavior variability in our samples, except for the 
relatively small South End and Cambridge Head Start samples. This meant 
that differences between observers would have an excessive effect. The 
more similar our samples of classes, the more differences will be due to 
observers, rather than reflections of real differences between classes. 
Similarly, the more heterogeneous the observers, the greater the differences 
between classes will have to be in order to come out ' s i gn i f i can t 1 y in an 
analysis. Obtaining samples of classes and observers which will allow 
between-c 1 ass inferences to be drawn is a critical step in advancing our 
knowledge about educational variation. - 

These analyses show a consistent manifold of observed behaviors that 
includes cognition ( process focus) , motivation ( gar t i c i pat i on) . activity 
level ( behavior) and independence ( control ) . The usual bifurcation between 
cognition (apt i tude-ach i evemen t) and soc i a 1 -emo t iona 1 does not come out. 
Although the analysis of stability does not justify the use of CRB data as 
a measure of individual differences, the eventual possibility is suggested 
by the structure. However, we will probably have to begin conceptualizing 
such differences as i nd i v i dua 1 -wi th i n-a-s i tua t i on differences, which will 
vary both from individual to individual and from situation to situation, 
with an individual X situation interaction. One of the real problems of 
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these analyses is that we were stuck with our sample clusters. It is not at 
all clear that these sample clusters were comparable. The univariate compari- 
sons of means and standard deviations presented in Table One can be easily 
interpreted, with the strong limitation that the interdependence of variables 
is not accounted for. We can say something about control and part ic i pat ion 
of the samples, but we cannot link them to one another, or to other dimensions 
of the space. 

Multivariate comparisons require covariance regularity -- a given 
comparison can be meaningless because, disregarding central tendencies, 
there is restricted variance and, therefore, little or no covariance. The 
nature of classroom atmospheres is likely to lead to this kind of d i 1 emna . 

The constellation of behaviors (CRB) in a classroom is generally either 
related or contingent. The more the teacher talks, the less the children 
talk. A high degree of observable participation will generally he associated 
with fairly high activity levels. This means that the proper description 
of classrooms is necessarily multivariate. But the set of contingencies 
might be quite different from classroom to classroom. We are used to stereo- 
types about teaching in classrooms -- which would make it appear that certain 
teacher characteristics might go together with certain kinds of classrooms. 

But our experience has been that there is no simple connection between the 
personalities and actions of teachers, and the classroom behavior of chil- 
dren. Although univariate connections obviously exist (Shea, 1968), there 
is a serious question about multivariate relationships. The factor analyses 
of. our data cause us considerable concern In this direction. They suggest, 
because of slight structural consistency, fairly high observer agreement, 
but practically no individual child stability, that we have inadvertently 
chosen disparate samples of classes. The rules of the game are so different 
from class to class that the measurement format breaks down at the individual 
child 1 evel . 

For example, we have to decide how many times and how long each time, 
to observe each child. So we begin observing many children in many classes 
and get data which suggests that the number of times a child has to be 
observed, depends on the child and the class, not upon a rule that will 
cover all applications of the instrument. But the number of times and 



0 



- 21 - 



duration are important -- they tell us something else about the child. 
However, if we have to observe some children twice and others ten times, we 
are in a methodological dilemna which we do not know about until after the 
fact., 

The point of this discussion -- or perhaps the apology -- is to artic- 
ulate a basic dilemna of studying classroom behavior of children, as opposed 
to studying psychometric behavior of children or the classroom behaviors of 
teachers. The child functioning in the classroom is not strictly comparable 
to any other child functioning in any other classroom, in the same way that 
he is psychometr ical ly comparable to other children. Classrooms and schools 
would appear to be different from one another in ways that affect child 
classroom functioning. The appropriateness of comparing child functioning 
will depend on the appropriateness of comparing classes. Until we can 
establish multivariate comparability of classes, measures of individual 
differences in classroom behavior will not mean anything and will produce 
the ambiguity that is evident in the factor analyses reported herein. 

Putting this in a slightly different way, child classroom behaviors 
depend on the child and the classroom (teacher, other children, school). 

The first comparisons are those between children in the same class. Next, 
are comparisons between children in classes that have common structure, so 
that there is a meaning to looking for classroom behavior structure that is 
not interminably confounded with between-cl ass variation. The next step 
will be to operationally define the dimensions of class structure. Lastly, 
individual differences in classroom behavior can be measured as a function 
of intervening structure. 

CRB Variation over Sex, Conditions, Classrooms and Observers 

Although we have many questions about the structural stability of the 
CRB, and its use as a measure of individual differences, several secondary 
analyses indicate that group comparisons are reasonable.- This in no way 
validates the CRB, nor does it provide solid evidence for accepting or 
rejecting hypotheses about group differences. Rather, it provides an 
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empirical foundation for the future development of classroom behavior strategy, 
in general, and the CRB, in particular. We have already discussed CRB scale 
means and standard deviations of Brockton children when they were in Head 
Start and first grade (Table One). Changes in means are understandable in 
terms of observer reports. By using CRB data on the same sample over time, 
there is some modest e.lement of control -- at a univariate level, we are 
beginning to tap relatively uncontaminated classroom behavior variation. 

Because of the structure of the Cambridge Follow-Through program, it 
lent itself to secondary analyses of several independent variables. Because 
of arguments presented in the above section on 'structure , 1 univariate 
analyses of variance were used on each scale, for each trial and for three 
combinations of trials (1~3, 4-6, 1 “6) . For the latter, the dependent 
variables included means and ranges. 

Analyses were done across the following dimensions: 
sex 

c lasses ( 1 1 ) 

conditions (experimental X control) 
observers (4) 

Table Five presents obtained 'F l ratios for each CRB scale (original) 
across all dimensions except sex, which was uniformly non-significant. 

Analyses presented include only those done on means of trials 1~3> 4-6 
and 1-6, which accurately represent the results of analyses of separate 
trials. The only exception to this was the analysis of variance for obser- 
vers which is presented only for trials I- 3 and 4-6 -- the means for trials 
1-6 for observers is not applicable because, in all cases, it includes two 
observers. The analyses of variance of ranges do not add anything and, 
therefore, are not presented. 



0 



- 23 “ 



Table Five 



Cambridge Follow-Through: 'F 1 Ratios from CRB Analyses of Variance for each 

Scale across Classes, Conditions and Observers (N = 66) for Trials 1-3, 4-6 
and 1-6 



Dependent Variable Independent Variable 

Cond i t i on (2) 



CRB Scales 


C lasses 


(M) 


Exp 


X Control 


Observers (4) 


Trials 1 “3 


4-6 


1-6 


1-3 


4-6 


1-6 


1-3 


4-6 


Control 


3.36 


4.25 


6.67 


X 


X 


X 


X 


4.99 


Process Focus 
Form 


X 


4.25 


2.98 


X 


X 


X 


X 


X 


Process Focus 
Con ten t 


X 


2.77 


3.20 


X 


X 


X 


X 


X 


Parti ci pat ion 


2.81 


7.43 


5-54 


X 


X 


5.66 


X 


10.28 


Behav i or 


•2.66 


X 


4.04 


4.79 


X 


4.91 


X 


X 


Style 


X 


6.47 


6.79 


X 


X 


X 


4,47 


3.59 


Use of Mat 1 1 


X 


2.74 


4.28 


X 


X 


4.75 


X 


X 


Group Size 
Overa 1 1 


X 


9.19 


10.90 


5.08 


27.53 


21 .27 


X 


X 


Group Size 
Circled 


2.24 


3.80 


7-09 


X 


10.96 


10.72 


X 


4.70 


Chi ld-Chi Id 
1 n teract i on 


2.34 


3.66 


5.69 


X 


X 


X 


5.90 


X 


T eacher-Ch i 1 d 
Interaction 


X 


2.71 


2.22 


X 


X 


X 


X 


3.59 


Verbal 
1 nteraction 


3.34 


2.77 


4.75 


7.66 


X 


X 


7.17 


3.86 


Attention 


5.89 


18.29 


12.08 


7-79 


X 


8.64 


8.87 


25.30 


Subject Dominant 


2.06 


14.62 


13-95 


X 


X 


X 


4.41 


24.41 


Other Dominant 


8.57 


3.06 


3-03 


5-29 


X 


X 


23-96 


6.53 


Curricular Index 


X 


X 


X 


X 


X 


X 


X 


X 


Degrees of Freedom 

Significance Level 
of F 

NOTE: x indicates 


10,55 

p = .05 2.00 

p = .01 2.66 

F less than p = .05 level 


of ! 


l ,64 

3.99 

7.04 

5 i gn i f i cance . 


3 

2 

4 


,62 

.76 

.13 
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It can be seen in Table Five that classroom 'F l s l are consistently 
significant, particularly on the trial 1-6 comparisons. Our expectation 
for a scale of classroom behavior was that it would consistently and signi- 
ficantly distinguish classrooms of non-selected teachers. This expectation 
was fulfilled. 

1 'Cond i t i nns" (Table Five) are generally not significantly different, 
with the exception of group size . As we reviewed that first year of Follow- 
Through, these results were to be expected. It was a new program which 
encountered great difficulties in getting started. In several schools, 
teachers would not willingly volunteer to be experimental teachers. The 
problem of developing a special program within the schools was disturbing 
both personnel and procedures* Our informal (CRB) and informal observations 
agreed -- Follow-Through did not make very much difference in classroom 
behaviors that first year* The differences in group size were straightfor- 
ward effects of more adults and fewer children in experimental classrooms.. 

Differences across observers were most marked in the i n terac t i on 
scales -- but the observers had questioned the format of those particular 
scales and suggested revisions. These were made in a subsequent revision 
of the CRB which was used in later applications. 

The pattern of results reported in Tables One and Five give us some 
support for our contention that the CRB is working at a group level. Such 
a strategy is basic to attending to what goes on in interventions -- the 
ways children actually function with a given teacher, methodology or class. 
Too much educational research has ignored classroom behavior and teacher 
effects while studying more accessible phenomena and characteristics. There 
has been much use of psychometrics, teacher questionnaires and sociometrics 
combined with the use of independent variables which were usually neither 
independent nor systematically variable. 

Correlates of CRB Scales: Tests, School Grades 

For the Brockton and South End-Cambr i dge samples, other data were 
available for children who had been observed and recorded with CRB protocols. 
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In the development of the CRB scales, there was no Intention of validating 
them on psychometric or school success criteria. They were developed indepen- 
dently of these potential criteria for very definite reasons and biases -- 
educational and psychological tests and school grades are built on a structure 
of stability which is associated with reliability and social class. Measure- 
ment of schoo 1 -re 1 a ted behaviors (as opposed to school behaviors) is an 
institutionalized way of keeping everyone on a steady course. Such measures 
have been validated by correlating them with the traditional success of 
successful people. When we think seriously about educational change, which 
is an imperative for lower income populations, we also have to ponder whether 
measurement procedures can either adapt to, or lead, the process. If schools 
are to be settings where change will be likely, we must be concerned with ways 
of altering existing behavioral patterns and of being more aware of the 
specific effects of alternative structures and personnel. Tests will cer- 
tainly not fulfill this function. They can only be responsive to more 
efficient procedures of accomplishing traditional goals. But traditional 
goals and their accomplishment leave an important segment of school popula- 
tions out of the running. Statistics on how many children are out of school 
and who they are, are very difficult to come by, but it is rather clear and 
accepted that many children drop out, particularly those from lower income 
minority groups who live either in inner cities, or in rural settings and who 
are often associated with migrant occupations. Without belaboring the point, 
the development of the CRB scales did not attend to such external criteria 
as tests and school grades. However, we thought it would be informative to 
look at correlations of CRB scales (or factor scores) with a number of tests 
and school grades. 

For the Brockton sample, we had the following measures in addition to 

CRB's: 

Lee Clark Reading Readiness Test 
California Test of Mental Maturity 
Peabody Picture Vocabulary Test 
Average school grades during first grade 
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Correlations between these. four measures and CRB scales were calculated 
for summer Head Start (1968), fall first grade ( 1 968) and spring first grade 
(1969). Because all of the correlations were near zero, multiple correla- 
tions were calculated using psychometric measures and average grades as 
independent variables and each scale of the CRB as dependent variables. 
Multiple ‘R l s l were all very low and ‘F 1 tests for residuals were non-signi- 
ficant with just one exception. CRB data was, therefore, independent of tests 
and school grades. 

CRB factor scores were calculated for the Cambridge-South End data 
(N = 76) and correlated to the pre and post tests of the 1 968-69 National 
Head Start Evaluation Program. The measures used in that evaluation were 
as foil ows : 



S tanford-B i net Intelligence Scale 

Animal House subtest of the Wechsler Preschool and Primary 
Scale of Intelligence (WPPSl) 

Preschool Inventory (Caldwell and Soule, 19 ) 

Gumpgookies (Adkins and Ball if, 1 967 ) 

Sociometric Scale (Boger, 1969) 



Factor scores were obtained from factors described in Table Four, 
above, with full knowledge of the tenuousness of the procedure. Correlations 
between factor scores and all of the above instruments were near zero (not 
significantly different from zero), which was the same result as that ob- 
tained from the Brockton data. 

In spite of the fact that the development of the CRB did not consider 
aptitude and achievement tests, it was certainly surprising that no evidence 
for correlation was found in two separate samples, one a largely black 
sample from an inner city area, the other a white sample from a mi 1 1 town 
about forty miles from Boston. The lack of stability for individuals may 
be all or part of the explanation for this. The cognitive measures, on the 
other hand, were fairly stable on test-retest -- S tanf ord-B i ne t Intelligence 
Scale, R = .68; WPPSl Animal House, R = .57; Preschool Inventory, R = .62. 

It is unlikely that the unstable CRB scales, with non-significant repeated 
measure correlations, would correlate significantly with other measures when 
they do not correlate with themselves. 
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Answers and Questions 



Although our data provide no definitive evidence for the lack of re- 
lationship between classroom behavior and tests, it is worthy of some spec- 
ulative discussion. Test theory and test construction have developed along 
well identified paths. They set up a behavioral situation, measure within 
it, and draw inferences for test results that ignore (or transcend) that 
situation. There is no question that test measures have been successful at 
going beyond test situations. From as little as a 15 _ minute group testing 
situation, a score can be obtained that will be highly predictive of how 
successful an individual will be in school -- which amounts to thousands of 
hours of behavior. While it is true that predictive power of testing is 
restricted to scholastic aptitude, certain industrial uses and a few clini- 
cal applications, their use is extensive. Whether testing contributes to 
stability (lack of change) or is simply a reflection of its existence in 
our society is a moot point -- in either case it should be quite clear that 
large-scale testing programs -- college entrance examination, graduate 
record examination, elementary and secondary school placement tests, military 
testing, civil service testing -- put a premium on behavior which is closely 
tied to criteria which were established on individuals who have already 
'made it, 1 Equally important is the efficiency and format of the testing 
model — take as little time as possible to make as big a prediction as is 
possible -- to make a prediction about a big thing that will take a long 
time. But this tells us something about who is to be the successful proto- 
type — someone who can show himself (maybe four years worth) in 15 minutes 
or two hours. 

This study of the CRB shows that small samples of classroom behavior 
(30 minutes) are not predictors of other small samples of classroom behavior 
and, it follows, are not predictors of test behaviors. In other words, our 
evidence points out that measured classroom functioning is much more variable 
than measured test functioning. It can be argued that the consistency of 
tests is a function of their structure -- all children are asked the same 
questions in the same way — and that the i neons i s tency of classroom behavior 
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is attributable to the measurement technique — watch the child doing any- 
thing he happens to be doing. But this is precisely what the game of change 
is all about. If structures — the what‘s and how's of questions or class- 
rooms -- are held constant, there will be no change either over time or 
between conditions. Classroom behavior has characteristics which make it 
much more important, if not uniquely so, for change, than tests. One of 
these characteristics — instability -- can be seen, at the same time, as 
a measurement headache, but also as a vital source of information about 
possible change, rather than about probable stability — the usual standard 
error of a test score. 

There are, then, a set of cha rac ter i s t i cs which are needed if we are 
to focus our attention on measurements which will facilitate change, rather 
than insuring a status quo. We have to be reasonably certain that quanti- 
tative and qualitative data can be communicated -- that operational defini- 
tions connect concepts and measurement to the mutual satisfaction of practi- 
tioners (in this case, teachers) and researchers. Legitimate instability 
cannot be sacrificed in the name of either reliability or convenience (money, 
time, effort, semester schedules, traditions of research funding.) 

In the, CRB we have developed an instrument which has passed some of 
the tests (sic) but which has a long way to go. We have continually involved 
teachers and parents in the development of the scales, including the choice 
of scales, verbalization, observational procedures and i nterpreta t ion . A 
heterogeneous group of observers, including non-degree community people, 
teachers and doctoral students, have observed children together and commu- 
nicated about behavior using CRB terminology. This agreement has been 
empirically verified by the observer agreement studies. 

The CRB is not correlated with traditional test measurements, although 
this would appear partly a function of insufficient observational data. 
However, it is clear that we can get reliable (observer agreement) data from 
observing classroom behaviors which are not associated with test behavior. 
There are many ways in which the connections between classroom and test 
behaviors can be explored — a search which is sorely needed. The fact that 
tli i s connection has not been made, nor has it hardly even been looked into, 
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is an insidious comment on research design and strategies. The questions 
that are being asked, or so i t would appear, are about what goes into the 
box and what comes out -- not what goes on inside. This clearly relegates 
process to insignificance. The process will continue to be ignored as long 
as psychometric canons prevail. Stick an 1 .Q. test with a dozen clinical 
reports and it will win out. Observational and clinical data will, in 
general, be obscured by tests and categories in any study which includes 
both, unless the tests and categories produce trivial results. The default 
of process means that process is given low order priority in the game of change 
-- for, if process were thought to be. important to change, we would certainely 
be more interested in it. Or perhaps it should be stated the other way: 
if we were more resolutely interested in change, process would be our principal, 
and perhaps unique, concern. 

We might begin asking questions about how we can change process, rather 
than how we can change individuals. This depends on whether we think changes 
in process can produce changes in individuals -- which does not mean that 
either process or individual transformations are mechanical, but that they 
take place in myriad ways, including mechanical. It was in terms of this 
line of inquiry that CRB scales were developed, applied and analyzed. But 
we have obviously only skimmed the surface of classroom process study. In 
other projects, we have filmed and video-taped teachers using a task structure 
-- each teacher does the same task so that many teachers can be compared in 
terms of process with task and materials held constant (Garfunkel, 1967b); 
participant observation methodology has been used to study how teachers deal 
with disturbance (Keane, 1969) and to compare experiences of black children 
who are bused to suburbs with those who stay in their inner city schools 
(Garfunkel, 1967a). The latter project used films as the principal media 
for presenting the final report (Garfunkel and Fiering, 1968) because the 
- focus of that report was on process. 

Vie see a need to simultaneously study process in a variety of ways, 
and to expend considerable energy on the development of vehicles for final 
reporting that will be appropriate to the subject matter and the audience. 

It is not at all clear that Ficherian statistics can do the job. Appropriate 
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mathematical models will have to be developed in order to deal with class- 
room realities, which are not at all 1 ike test realities. We certainely 
came nowhere near describing stable functions -- they are obviously more 
complex than univariate analyses will allow. It is difficult for genera- 
tions of researchers steeped in analysis of variance, correlation, factor 
analysis and true scores to realize that there are grave discrepancies 
between our biologically derived experimental models and human learning in 
social situations. As we struggled with CRB data, the inappropriateness of 
many traditional analytical techniques became more and more obvious. There 
were times (more often than we would like to admit) when our analyses fitted 
the program library at our excellent computer center, which have been developed 
because of design and methodological traditions. But it became more and more 
obvious, as we got into our data, that it did not fit these traditions* 

Recent work by Cronbach and Snow (1969) takes a view of learning which is 
similar to ideas behind CRB development. As they found out, as is painfully 
obvious in this report, the schedules of time, effort and monies will have 
to be drastically revised in order to adequately describe i ns true t iona 1 
variables in close and continuing connection with individual pupil differences 
over time. But this also requires serious changes in school practices, which 
have developed as if instructional variables and their interaction with indi- 
vidual pupil differences are of negligible importance. We refer here to 
policies and practices relating to teacher training, hiring of teachers, 
supervision, testing of children, curriculum development, placement of 
children in classes and promotion policies. 

Perhaps the most important innovation which would follow from atten- 
tion to process would be the necessary concomm i ttan t opening up of systems. 
Derogation of process is perfectly consistent with the closed character- 
istic of most educational systems -- if process does not make too much dif- 
ference, than, obviously, it does not have to be open for inspection, super- 
vision or research. The book-keeping process cannot look at everything, so 
it keeps track of that which is most highly valued. Without drawing too many 
conclusions at the same time, it seems fairly evident that there are important 
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connections between values, programming and evaluation. Work on the CRB 
is an expression of values which would affect programming and how we view 
changing the educational order of perceptions. 
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APPENDIX 



KEY FOR SCALES OF THE CLASSROOM BEHAVIOR FORM (CRB) 



1 6 2 CONTROL; Overt & covert: Who controls the child's activities? 

1. External; Teacher, other child or group; with direction and 
i ns i s tence. 

2. External; Direction, suggestion without insistence. 

3. A1 ternat i ves presen ted by teacher or other children. 

4. External- I nternal collaboration: Teacher-child or child-child 

discussion and participation in decisions. 

5. Internal: Child controls his activities. 



3. CURRICULUM: 

Classification 
1 Activity 



2 Substantive 



3 Routines 



o 

ERIC 



Categor i es 

11 Construction (wood, paper, clay, 
blocks, painting, collage, cook- 
ing, etc.) 

12 Performi ng (music, role playing, 
puppets, housekeeping, games, etc 

13 PI ay (water, sand, swing, jungle 
gym, jumping, etc.) 

1 b Undef i ned (wandering, physically 
or otherwise) 

15 Interval 

16 Social Interaction Situations 
(casual play, 1 augh i ng , talking , 
arguing, fighting, etc.) 

21 Science (biology, physics, 
chemistry, botany, zoology, etc.) 

22 Quan t i tat i ve (numbers, sizes, 
shapes, puzzles) 

23 Language (speaking, discussion, 
stories, writing, reading) 

2b Social Relations (interpersonal 
relations, culture, social organi 
zation, history, community, etc.) 

31 Snack (juice, lunch) 

32 Clean Up (of room, desks, tables, 
sink, etc.) 

33 Res t 

3b Other (arrival, departure, toilet 
ing, washing, dressing, etc.) 

40 
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k. PROCESS FOCUS - FORM: At what level is the form (structure) of 

the activity? How does it take place? 

1. Traditional, authoritarian, forces: Repetition, telling, 

direct questions, textual presentations, fighting. 

2. (intermediary point) 

3. Demonstration, showing and illustration. 

4. (Intermediary point) 

5. Experimental, systematic procedures, planning, deduction, 
induction, explanations, with involvement and participation 
of child. 

5. PROCESS FOCUS - CONTENT: At what level is the content? What is 

taking place? 

1. Mechanical: Labels, definitions, memorization, recitation. 

2. Skills: Motor, verbal. 

3. Precepts: Visual, auditory, tactual discriminations. 

4. Concepts: Ideas, issues. 

5. Trans forma t i ons : Relationships between concepts, situations, 

events v , i mp 1 i ca t i ons , analyses. 

6. PARTICIPATION: Involvement. How is the child participating in 

what he is doing? 

1. Rejection, Reluctance. 

2 . I nd i f ference . 

3. Mild I nvol vement . 

4. Moderate Involvement. 

5. Intense Involvement. 

7. BEHAVIOR: Activity level. How active is the child? 

1. Very low, non- respons i ve , withdrawn. 

2. Low passive. 

3. Average. 

4. Moderately active. 

5. High: Acting out, aggressive. 

8. INTERACTION. Agents. 

0. None 

1. Teacher and child(ren) 

2. Child and child(ren). 

9. INTERACTION: Type (between child and other children or teacher 

and chi Id.) 

0. None 

1. Parallel play or work. 

2. (intermediary point) 

3. Non-verbal interaction. 

4. (intermediary point) 

5. Verbal interaction. 
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INTERACTION: Mode of Interactions, 

0. None 

1. Negative indication, fighting, restriction, disagreement, 
des t ruct i ve. 

2. (Intermediary point) 

3. Neutral , 

4. ( 1 ntermed i ary point) 

5. Positive indication, supporting, sharing, reassuring. 



11. USE OF MATERIALS: How teacher or child uses materials, equipment, 

games, facilities, people, situations. 

1 . ! rre 1 evant . 

2. Prosaic, repetitive. 

3. Conventional, routine. 

b. Appropriate, but with some diversity. 

5. Unique, resourceful, creative, unconventional. 



12. ROLE OF CHILD: 

1 . Spectator , 

2. (intermediary point) 

3. Variable. 

(intermediary point) 

5. Active participant, 

13. GROUP SIZE: (include teacher) Number of individuals working or 

playing together. If group is clearly involved in 
common activity, circle number. 

1^. APPROPRI ATENESS : Observer's subjective reaction to child's 

behavior or to interaction. 

1 . I nappropr i ate . 

2. Neutral. (Does not have to be recorded.) 

3. Appropriate. 
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